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Abstract. In formal language theory, one of the most fundamental tools, known as pumping 
lemmas, is extremely useful for regular and context-free languages. However, there are natural 
properties for which the pumping lemmas are of little use. One of such examples concerns a notion 
, of advice, which depends only on the size of an underlying input. A standard pumping lemma 

' encounters difhculty in proving that a given language is not regular in the presence of advice. We 

, develop its substitution, called a swapping lemma for regular languages, to demonstrate the non- 

regularity of a target language with advice. For context-free languages, we also present a similar 



^ , form of swapping lemma, which serves as a technical tool to show that certain languages are not 

context-free with advice. 
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^ ; 1 Motivation and Overview 



Since their creation in the 1940's, formal languages and grammars have found many applications in, for 
instance, programming languages and artificial intelligence. A rapid development of the Internet, in particular, 
may treat one-way finite automata as a meaningful model of hand-held communication devices with a small 
amount of fast cache memory, which can process incoming information streamlined through a communication 
' channel. Such types of automata are closely associated with the classical notions of regular languages and 

. context-free languages, which were also classified respectively as type 3 and type 2 by Chomsky. These notions, 

which are foundations to formal language theory, have been a key subject of many core undergraduate curricula 
in computer science. The beauty of these notions comes from their simplicity and applicability. 
OO ' One of the most useful tools in formal language theory is so-called pumping lemmas 1 for regular and 

, context-free languages. Most undergraduate textbooks in this classic theory describe these lemmas (and their 

variants) as a powerful but fundamental tool. A standard methodology of proving the non-regularity of a 
target language, for instance, is to apply the pumping lemma to the language, deriving a desired contradiction 
if we begin with assuming the regularity of the language (see, e.g., |4j for its argument). For certain natural 
, properties of languages, however, the pumping lemmas are not in the most useful form and we thus need to 

^ ' develop another form of lemmas to prove those properties. A typical example concerns "advised computation." 

5^ , One way of enhancing the power of machine's language recognition is to provide a piece of advice, which 

is a string depending only on the length of input strings, to the original input strings. Karp and Lipton [5] 
formulated a mechanism of such advice for polynomial time-bounded computation. Inspired by the work of 
Karp and Lipton, Damm and Holzer [3] and Tadaki, Yamakami, and Lin [5] discussed the computational 
power of finite automata when advice is provided as supplemental information to their single read-only input 
tapes. See Section [2] for the formal definition of automata that take advice. As in [6], we use the notation 
REG/n to denote the family of languages recognized by deterministic finite automata if advice of size exactly 
n is provided in parallel to an input of size n. Similarly, we define CFL/n for the language family characterized 
by nondeterministic pushdown automata with such advice. 

As is known, the advised family REG/n is quite different from REG, the family of regular languages. 
Typical context-sensitive languages, such as L^eq = {a"'b"c" \ n > 0}, naturally fall into this advised family 
REG/n. On the contrary, certain deterministic context-free languages, such as Equal — {w G {0, 1}* | 
#o(w^) — #i(w^)} and GT = {w G {0, 1}* | #o(it;) > #i(w)}, where #a{w) denotes the number of a's in w, 
do not belong to REG/n. How can we prove this fact? Now, let us try to prove that, for example, Equal is 
not in REG/n using the pumping lemma for regular languages. Assume otherwise that Equal is characterized 
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by a certain regular language L with advice strings over an alphabet V. Now, we apply the pumping lemma 

by picking a pumping-lcmma constant m and then choosing an appropriate string w (together with an advice 
string) in L of length at least m. Using its decomposition w = xyz, the pumping lemma pumps this chosen 
string w with advice given in parallel to w, generating a new series of strings of the form xy^z. These pumped 
strings must belong to L; however, arc they still in Equal with the appropriate advice strings? At this point, 
we encounter a serious flaw in our argument. The pumping process unwisely pumps the original string as 
well as the valid advice string. Since the pumping lemma forces the size of pumped strings to change, their 
associated pumped advice might no longer be "valid" advice for Equal. Therefore, we cannot conclude that 
the pumped strings are actually in Equal with advice. To avoid this pitfall, we need to develop a new lemma, 
which keeps advice valid before and after the application of the lemma. 

In this paper, we shall present such a desired lemma, which we refer to as the swapping lemma, encompassing 
an essential nature of regular languages. In many cases, this new lemma is as powerful as the pumping lemma 
is. As examples, we shall later demonstrate, by a direct application of the swapping lemma, that the context- 
free language Pal = {ww^^ \ w G {0,1}*} (evcn-lcngth palindromes), where is w in reverse, and the 
aforementioned languages Equal and GT cannot be in REG/n (and therefore, they are not in REG). The last 
two examples show a separation of DCFL, the family of deterministic context-free languages, from the advised 
class REG/n. This immediately yields the class separation DCFL/n 7^ REG/n, which has not been known so 
far. Our proof of the swapping lemma for regular languages is considerably simple and can be obtained from 
a direct application of the pigeonhole principle. 

Likewise, we also introduce a similar form of swapping lemma for context-free languages to deal with the 
non- membership to the advised family CFL/n. With help of this swapping lemma, as an example, we prove 
that the language Dup = {ww \ w G {0,1}*} (duplicating strings) is not in CFL/n (and therefore not in 
CFL, the family of context-free languages). Another (slightly contrived) example is the language Equals 
consisting of all strings w over an alphabet of 6 symbols together with a special separator # for which each 
symbol except # appears the same number of times in w. Since Equals is in the complement of CFL, denoted 
co-CFL, we obtain a strong separation between CFL/n and co-CFL/n; in other words, CFL/n is not closed 
under complementation. Our proof of the swapping lemma for context-free languages is quite different from a 
standard proof of the pumping lemma for context-free languages. Rather than using context-free grammars, 
our proof deals with a certain restricted form of nondeterministic pushdown automata to track down their 
behaviors in terms of transitions of first-in last-out stack contents. 

The main purposes of this paper are summarized as follows: (i) to introduce the two swapping lemmas for 
regular and context-free languages, (ii) to give their proofs by exploiting certain structural properties of finite 
automata, and (iii) to demonstrate the strong separations between DCFL/n and REG/n and between CFL/n 
and co-CFL/n. 

We hope that the results of this paper should contribute to a fundamental progress of formal language 
theory and revive fresh interest in basic notions of regular languages and context-free languages. 

2 Notions and Notation 

The natural numbers are nonnegative integers and we write N to denote the set of all such numbers. For any 
two integers m, n with m < n, the notation [m, n]z stands for the integer interval {m, m + l,m + 2, . . . , n}. 

An alphabet is a nonempty finite set and our alphabet is denoted by either E or F. A string over an 
alphabet S is a series of symbols from S. In particular, the empty string is always denoted A. The notation 
S* expresses the set of all strings over S. The length of a string w, denoted \w\, is the total number of symbols 
in w. For each length n G N, we write S" (resp., S-") for the set of all strings over S of length exactly n 
(resp., at most n). For any non-empty string w and any number i G [0, \w\]z, prefi{w) denotes the first i 
symbols of w; namely, the substring s of w such that \s\ = i and sx = w for a certain string x. In particular, 
prefo{w) = A and pref\^\{w) = w. Similarly, let sufi{w) be the last i symbols of w. For any string x of 
length n and two arbitrary indices i,j G [1, n]z with i < j, the notation middij{x) denotes the string obtained 
from X by deleting the first i symbols and the last n — j symbols of x; thus, middij {x) contains exactly j — i 
symbols taken from x. As a special case, middi^i{x) expresses the ith symbol of x. It always holds that 
X = prefi{x)middijsufn-j{x). 

For any language L over S, the complement T,* —L oi L is denoted L whenever S is clear from the context. 
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The complement of a family C of languages is the collection of all languages whose complements are in C. We 
use the conventional notation co-C to describe the complement of C. 

We denote by REG the family of regular languages. Similarly, the notation CFL represents the family 
of context-free languages. We also use the notation DCFL for the deterministic context-free language family. 
We assume the reader's familiarity with fundamental mechanisms of one-tape one-head finite-state automata 
and their variant, pushdown automata (see, e.g., [4] for their formal definitions). Concerning these machine 
models, REG, CFL, and DCFL can be characterized by deterministic automata (or dfa's), nondeterministic 
pushdown automata (or npda's), and deterministic pushdown automata (or dpda's), respectively. 

Now, let us briefly state the notion of advice in the form given in [5], which is slightly different from [3]. 
First, we explain how to provide advice strings to finite automata using a "track" notation. Consider a finite 
automaton with one scanning head moving on a read-only input tape, which consists of tape cells indexed 
with integers. For simplicity, the leftmost symbol of any input string is always placed in the cell indexed 1. 
Now, we split the tape into two tracks. The upper track contains an original input x given to the machine 
and the lower track carries a piece of advice, which is a string w (over a possibly different alphabet) of the 
length |a;|. More precisely, the tape contains n tape cells consisting of the string [ Z ] = [ II ][ II ] ' ' ■ [ IZ ]' 
where x = xiX2 ■ ■ ■ x„ and w = ai<T2 • ■ • fn, in such a way that each ith cell contains the symbol [ ]• The 
machine takes advantages of this advice w to determine whether it accepts the input x or not. To deal with 
all different lengths, advice is generally given as a form of functior0 h (which is called an advice function) 
mapping N to F*, where F is another alphabet, such that \h{n)\ — n for any length n G N. 

The succinct notation REG/n, given in [5], denotes the collection of all languages L over an alphabet S 
such that there are an advice function h and a dfa M for which, for all strings x €Yj* , x & L \E M accepts 
[ ]. Since each dfa M characterizes a certain regular language, say, L', the above definition of REG/n 
can be made machine-independent by replacing the dfa M by the regular language L': let L 6 REG/n if 
there exist an advice function h and another language L' in REG for which, for all strings x € , x € L 
iff [ ,,(f°„|) ] e L' . Similarly, we can define the advised families CFL/n and DCFL/n from CFL and DCFL, 
respectively. 

To help the reader grasp the concept of advice, we shall see a quick example of how to prepare such advice 
and use it to accept our target strings. Consider the context-sensitive language Lseg = {a"6"c" | n £ N}. It 
is obvious that L^^q is not a regular language. Let us claim that Lseg belongs to REG/n. 

Example 2.1 Consider the non-regular language L-^eq — {a"b"c^ \ n G N}. It is easy to check that L^eq 
belongs to REG/n by choosing an advice function h defined as h{n) — dp-l'^h'^l'^c^l'^ if n = (mod 3) and 
h(ri) = 0" otherwise. How can we recognize this language with advice? Consider a dfa M that behaves as 
follows. On input [ ^(p^i) ] with advice /i(|a;|), if a; = A, then accept the input immediately. Otherwise, check 
if the first bit of h(ri) is a. If so, check if a; = /i(|a;|) by moving the tape head one by one to the right. This is 
possible by scanning the upper and lower tracks at once at each cell. If the first bit of h(n) is c instead, reject 
the input. It is obvious that M accepts [ ^(j"^!) ] iff a; G Lseq- Therefore, Lseg belongs to REG/n. □ 

Another example is a context-free language Pal — [ww^ | if G {0, 1}*} (even-length palindromes), where 
denotes w in reverse. It is well-known that Pal is located outside DCFL; however, as we show below, 
advice helps Pal sit inside DCFL/n. 

Example 2.2 The non- "deterministic context-free" language Pal belongs to DCFL/n. This claim is shown 
as follows. It is well-known that the "marked" language Pal^ = {w^fw^ | u; G {0, 1}*}, where # is a center 
marker, can be recognized by a certain dpda, say, M (see, e.g., [3] for its proof). The center marker in Pal^ 
gives M a cue to switch the dpda's inner mode at the time the dpda's head moves from w to w^. More 
precisely, the dpda M stores the left substring w in its stack and, upon its cue from the marker, M checks 
whether this stack content matches the rest of the tape content. Since there is no center marker in Pal, we 
instead use an advice function h to mark the boundary between vu and vu^ in ww^'. We define /i(0) — A, 
h{n) = 0"/^~^101"/^~^ if n is even with n > 2, and h{n) = 1" if n is odd. The first occurrence of 10 in h{n) 

*In the original definition of advice functions by Karp and Lipton j5j, these functions are not necessarily computable. This is 
mainly because we are interested in how much "information" with which each piece of advice provides an underlying machine, 
rather than how to generate such information. 
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in the even case, for instance, signals the time of transition from w to w in a similar way as the dpda M 
does for Pal^. This advice places Pal in DCFL/n. □ 



3 Swapping Lemma for Regular Languages 

Our goal of this section is to develop a new form of useful lemma, which can substitute the well-known pump- 
ing lemma 1, for regular languages, even in the presence of advice. We have seen the power of advice in 
Examples 12.11 and 12.21 advice helps dfa's recognize non- "regular" languages and also helps dpda's recognize 
non- "deterministic context-free" languages. When we want to show that a certain language L, such as Equal 
and Pal, does not belong to REG/n, a standard way (stated in many undergraduate textbooks) is an appli- 
cation of the pumping lemma for regular languages. A basic scheme of the standard pumping lemma (and 
many of its variants) states that, for any infinite regular language L and any string w in L, as far as its string 
size is large enough (at least a certain constant that depends only on i), we can always pump the string w 
(by repeating a middle portion of w) while keeping its pumped string within the language L. Unfortunately, 
as discussed in Section [1] the pumping lemma is not as useful as we hope it should be, when we wish to prove 
that certain languages are located outside the advised family REG/n. To achieve our goal, we need to develop 
a different type of lemma, which we call the swapping lemma for regular languages. 
We begin with a simpler form of our swapping lemma. 

Lemma 3. 1 [Swapping Lemma for Regular Languages] Let L be any infinite regular language over an 
alphabet T, with > 2. There exists a positive integer m (called a swapping-lemma constant) such that, for 
any integer n > 1 and any subset S of L Cl S" of cardinality more than m, the following condition holds: for 
any integer i £ [0,n]z, there exist two strings x — xiX2 and y — yiy2 in S with \xi \ = \yi \ = i and \x2\ — \y2\ 
satisfying that (i) x y, (ii) yiX2 G L, and (Hi) xij/2 G L. 

Proof. We prove the lemma by a simple counting argument with use of the pigeonhole principle. Let L be 
any infinite regular language over an alphabet S. Choose a dfa M = {Q, E, 5, qo, F) that recognizes L, where 
Q is a finite set of inner states, 5 is a transition function, qo € Q is the initial state, and F C Q is a set of final 
states. We define our swapping-lemma constant m as \Q\. Let n be any integer at least 1 and let S be any 
subset of L n E" with \S\ > m. Clearly, j^l > 2. Choose an arbitrary index i S [0, n]z. If either i = or i = n, 
then the lemma is trivially true (by choosing any two distinct strings x,y in S). Henceforth, we assume that 
n > 2 and 1 < i < n — 1. Consider internal states just after scanning the ith cell. Since 15*1 > |Q|, there are 
two distinct strings x,y € S for which M enters the same internal state, say q, after reading the ith symbol 
of X as well as y. Since the dfa cannot distinguish prefi{x) and prefi{y) after reading these substrings, M 
should accept the swapped strings prefi{x)sufn-i{y) and pre fi{y)sufn-i{x). This completes the proof. □ 

Notice that, in the no-advice case, our swapping lemma can be used as a substitute for the pumping lemma 
when a target language L is not "slim" enough (i.e., |Lnl]"| > m). Let us demonstrate two simple examples 
of how to use our swapping lemma. The first example is the context-free language Pal = {ww^ \ w G {0, 1}*}. 

Example 3.2 The context-free language Pal is not in REG/n (and thus not in REG). Assume that Pal 
belongs to REG/n and apply the swapping lemma for regular languages. Since Pal e REG/n, there are a 
language L G REG over an alphabet E and an advice function h such that, for every string x G {0, 1}*, 
[ h(\x\} ] G L iff a; G Pal. Take a swapping-lemma constant m that satisfies the swapping lemma for L. Choose 
n = 2m and i = n/2. Let a subset S oi L DT," he S = {[ ^(„^ ] G L | \x\ = n}. Notice that \S\ > 2"/^ > m. 
By the lemma, there are two distinct strings x,y G {0, 1}" that force [ ^(„^ ] and [ ^^^^ ] to fall into S. By 
letting Ml = prefi{x) and U2 = prefi{y), the strings x and y are written as a; = uiuf and y = U2M^. Now, 
let us consider the two swapped strings U1U2 = prefn/2{x)sufn/2{y) and M2uf = prefn/2iy)suf„/2ix). These 
strings are clearly not of the form ww^, and thus the swapped strings [ "^^^^l^ ] and [ "^^^"^ ] cannot belong to 
L. This is a contradiction against the swapping lemma. Therefore, Pal is not in REG/n. □ 

The use of the subset S in the swapping lemma. Lemma 13.11 is of great importance in dealing with the 
advised family REG/n because, for instance, S in the above example cannot be defined as 5 = L n E" in 
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order to lead to a desired contradiction. There are also cases that require more dexterous choices of S. One 
of those cases is the non-regular language Equal — {we {0, 1}* | #o(w) = 

Example 3.3 The deterministic context-free language Equal is not in REG/n. This statement was first 
stated in [6 . Our purpose here is to apply our swapping lemma to reprove this known result. Assume that 
Equal is in REG/rt. There are a regular language L and an advice function h such that, for every binary string 
X, X G Equal iff [ ^d"^,) ] G L. Take a swapping-lemma constant m and set n — 2m as well as i — n/2. In this 
example, we cannot take a subset 5 = {[ ^j";,) ] S i | |x| = n} as we have done in Example 13.21 instead, we 
rather choose n/2 + 1 distinct strings wq, wi,W2, . . . , Wn/2 G {0, 1}", where Wk = o'=l"/2~'^0"/^^'^l'' for each 
index k e [0, n/2]z, and we then define = |[ ],...,[ ]|. Clearly, the cardinality \S\ is more than 
TO. The crucial point of the choice of Wk's is explained as #o(p^6/n/2(^'^fc)) ~ ^ for number k G [0, n/2]^. 
The swapping lemma provides two distinct strings x = wj and y — Wk (j ^ k) such that [ ^^^^ ], [ ^^^j ] e S 
and [ h"?!) ], [ h{i.) ] S L, where the swapped strings ui and U2 are of the form ui = prefn/2{x)sufn/2{y) and 
"2 — prefn/2{y)sufn/2{x)- It easily follows that 

Ti Tl 

#o(wi) = #Q{prefn/2ix)) + #o(sw/«/2(y)) =-^'+2^^^2 
since j ^ k. This contradicts the result that [ ^"J.) ] G L. Therefore, Equal cannot be in REG/n. □ 

Next, we present a more general form of our swapping lemma. In Lemma I5TT1 two strings x and y are both 
split into two blocks and one of these blocks is used for swapping. In the next lemma, in contrast, these strings 
are split into any fixed number of blocks, one of which is actually used for swapping. This form is useful when 
we want to show that, for instance, the non-regular language GT — {w E {0, 1}* | #o(w) > #i(w)} does not 
belong to REG/n. 

Lemma 3.4 [Swapping Lemma for Regular Languages] Let L be any infinite regular language over an 
alphabet E with |E| > 2. There is a positive number to such that, for any number n > 1, any set S C L H E", 
and any series {ii,i2, ■ ■ ■ ,ik) G ([Ij'^lz)'^ with *i — certain number k G [l,n]z, the following 

statement holds. If \S\ > m, then there exist two strings x = xiX2 ■ ■ ■ Xk+i and y = yiy2 ■ ■ ■ Vk+i 'in S with 
\xk+i\ = \yk+i\ and \xj>\ — \yj'\ = ij' for each index j' G [l,fc]z such that, for every index j G [l,k]j,, (i) 
x^y, (a) xi ■ ■ -Xj-iyjXj+i ■ ■ ■ Xk+i G L, and (Hi) yi ■ --yj^ix^yj+i ■ --yk+i G L. 

Proof. Note that, when k = 1, this lemma is indeed Lemma [3.11 Consider a dfa M — {Q, E, S, qo, F) that 
recognizes L. Let S C L n E" satisfy \S\ > m, where to = IQ]*"'. To each string s G S", we assign a fc-tuple 
52, ■ ■ ■ , qk) of internal states of M such that, for each j G [1, k]z, M enters state qj after scanning the 
(^g^-^Ze)th cell. There are at most \Q\^ such tuples. Since \S\ > \Q\'', there are two distinct strings x,y in 
S such that they correspond to the same series of internal states, say (qi, q2, . . . , qk)- Write x — xiX2 ■ ■ ■ Xk+i 
and y = yiy2 - ■ - yk+i, where = \yk+i\ and \xj' \ = \yj' \ = ij> for every index j' G [l,fc]z. Notice that, 

for each index j G [l,A:]z, M enters the same internal state qj after scanning Xj as well as yj on the inputs 
X and y, respectively. Fix an index j G [1, fc]z arbitrarily. From the choice of x and y, we can swap the two 
blocks Xj and yj in x and y without changing the acceptance condition of M . Therefore, the swapped strings 
xi • • ■ Xj-iyjXj^i ■ ■ • Xk+i and yi ■ ■ • yj-iXjyj+i ■ ■ • yk+i are both accepted by M. This gives the conclusion of 
the lemma. □ 
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Figure 1: A string w^^-* with j = 3 and m = 7 
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Let us demonstrate how to apply Lemma 13.41 to the deterministic context-free language GT. 

Example 3.5 The deterministic context-free language GT is not in REG/n. Assuming that GT e REG/n, 
we choose an advice function h and a regular language L over an alphabet E such that, for every binary 
string X, X & GT iff [ ^(^^|, ] G L. Since L is an infinite regular language, we can apply Lemma 13.41 to L. 
Let m be a swapping-lemma constant. Without loss of generality, we can assume that m is odd and at least 
3. Define n = and we are focused on the set L n S". Let (ii,i2, . . . ,«m-i) be a unique series defined 
by ij — m for every index j g [l,m — 1]^. This series makes each n bit-string partitioned into m blocks of 
equal size m. For each index j G [l,m]z, let w^^' denote the string of the following form: (i) 

the jih block w^''-' equals 01™^^ and (ii) any other block w^''^ equals 0™ , where m' = [m/2j. Figure 

[T] gives an example of w*^^^ when j = 3 and m = 7. Since #o(w*'^-') = 4j=i{w'^^'>) + 1, this string w'--'^ belongs 
to GT. The desired set 5 is thus defined as {[ ], • ■ • j [ '°h(2') ]}• Clearly, |5| > m. By Lemma [Ol there 
are two distinct strings w^'^^ and w^'^ [k ^ I) such that [ ], [ ] G 5* and [ ], [ ] G L, where 
w'^^^ and are obtained respectively from w'^^^ and w*^'^ by swapping their Zth blocks. Notice that, for each 
I G {0,1}, = - Since #oH^'^) = + 1, #o(?i'f') = 1, and 

#i(u';''') = m — 1, it immediately follows that #i(?Z>'-'^-') = #o{w''''^) + m — 2, which is greater than ^o{w''''^). 
This implies that li'-'^^ ^ GT, contradicting the conclusion of Lcmma[2Hl Therefore, GT cannot be in REG/n. 
□ 

From Examples 13.31 and 13.51 since Equal and GT are both deterministic context-free, we can obtain a 
separation between DCFL and REG/n. This gives a strong separation between REG/n and DCFL/n, because 
REG/n = DCFL/n implies DCFL C REG/n (using the fact that DCFL C DCFL/n). 

Proposition 3.6 DCFL ^ REG/n. Or eqmvalently, REG/n 7^ DCFL/n. 

4 Swapping Lemma for Context-Free Languages 

We have shown the usefulness of our swapping lemma for regular languages by proving that three typical 
languages cannot belong to REG/n. Now, we turn our attention to CFL/n, the family of context-free languages 
with advice. A standarcJ3 pumping lemma for context-free languages helps us pin down, for example, the 
language Dup = {ww \ w £ {0, 1}*} (duplicating strings) into the outside of CEL. As in the case of regular 
languages, this pumping lemma is also of no use when we try to prove that Dup is not in CFL/n. This 
situation urges us to develop a new form of lemma, the swapping lemma for context-free languages. 

To state our swapping lemma, we introduce the following notation for each fixed subset S of S". For 
any two indices i,j G [l,n]z with i + j < n and any string w G S-*, the notation Si^u denotes the set 
{x S \ u = middi^i^jlx)}. It thus follows that S — IJ«gsj "^^^ ^'^^ each fixed index j G [l,n]z. 

Lemma 4-1 [Swapping Lemma for Context-Free Languages] Let L be any infinite context-free language over 
an alphabet E with \T,\ > 2. There is a positive number m that satisfies the following. Let n be any positive 
number at least 2, let S be any subset o/LnS", and let jo, k G [2, n]z be any two indices satisfying that k > 2jo 
and \Si^u\ < |S'|/m,(fc — jo + l)(n — jo + 1) for any index i G [1, n — j'o]z and any .string u G . There exist 
two indices i G [1, n]z and j G [jo, fcjz with i + j < n and two strings x = X1X2X3 and y = ^12/22/3 in S with 
\xi\ = \yi\ = i, \x2\ = \y2\ = j, and \x3\ = {y^l such that (i) X2 ^ 2/2, (n) Xxy2X-i G L, and (Hi) yiX2y3 e L. 

The above form of our swapping lemma is similar to Lcmma l3.41 however, we can no longer choose a pair 
{i,j) at our will. Moreover, the cardinality of a subset S must be much larger than that in the case of regular 
languages. Since the proof of this lemma is more involved than that of Lemma 13.41 we postpone it until the 
next section. 

Meanwhile, we see how to use the swapping lemma for context-free languages. First, it is not difficult to 
show that Dup does not belong to CFL/n by applying the lemma directly. 

tThere are several well-known variants of pumping lemma for context-free languages. Ogden's lemma [7| is such a variant. 
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Example 4.2 The language Dup is not in CFL/n (and thus not in CFL). Let us assume that Dup G CFL/n 
to lead to a contradiction. First, choose an advice function h and a context-free language L such that, 
for any binary string x, x £ Dup iff [ ^(^^^ ] G L. Let m be a swapping-lemma constant for L. Second, 
choose any sufficiently large even number n satisfying that 2"^^ > 2mn^. Now, let us define a subset S 
as S* = {[ ^j'^jj) ] G L I |x| = n}. It suffices to satisfy the condition that \Si^u\ < \S\/kmn for any index 
i e - jo]z and any string m G S^". Since j^l = 2"/^ and |5'^,„| = 2"/2-l''l for any string u G I]^"/^^ we 
need to set k = n/2 and jo = [log2 mn^] + 1. By the swapping lemma for context-free languages, there are 
two indices j G [jo: fc]z and i G [1, n — j]-^ and two strings x = X1X2X3 and y = 2/12/22/3 in with \xi \ — \yi\ = i, 
2^2! = I2/2I = j, and l^al = jj/aj such that (i) 2:2 7^ 2/2: (ii) xiy2X^ G L, and (iii) 2/1X22/3 G There are three 
cases to consider: (a) i + j < n/2, (b) i < n/2 < i + j, and (c) n/2 < i. Let us consider Case (a). Since i >1 
and 2 < j < n/2, both X2 and 2/2 are respectively in the first half portion of x and y. Therefore, it is obvious 
that the swapped strings xiy2X3 and 2/12^22/3 are not of the form [ ^^J™) ]. This is a contradiction. The other 
cases are similar. Therefore, Dup cannot be in CFL/n. □ 



In the above example, the choice of k is crucial. For instance, when fc = n/2 + 1, there is a case where we 
cannot lead to any contradiction. Consider the following case. Take two strings x — X1X2X3 and y = 2/12/22/3 
satisfying that xi = yi — 0"/^~^, X3 — ys = l"/^, and X2 = OX3X1O, and 2/2 — l2/32/il- Clearly, x and y are in 
L n {0, 1}" and the swapped strings xiy2X^ and 2/12^22/3 are also in L. 

Our next example is a slightly artificial language EqualQ, which consists of all strings w over the alphabet 
S — {ai, a2, . . . , ag, #} such that each symbol except # in E appears the same number of times in w; that is, 
4l^a{w) — 4f^b{w) for any pair a, 6 G S — Note that the complement Equals is in CFL. This containment 

is shown by considering an npda that behaves as follows: on input w, choose two distinct symbols, say a and 
6, in S — {#} nondeterministically and check if #a(w) 7^ 4/^b{w). In other term. Equals is in co-CFL. On the 
contrary, we can show that Equals cannot belong to CFL/n. 
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Figure 2: A form of w with ^ai{w) = 4 and #^(w) = 24 when n = 48 



Example 4.3 The language Equals is not in CFL/n. Assuming that Equals G CFL/n, we choose an advice 
function h and a language L G CFL such that, for every string x G E*, x G Equals iff [ ] G L. Since 
L G CFL, take a swapping-lemma constant rn for L. Let n = 864to. For each symbol in E, we use 
the notation for the string (ai)'^#"/^^~'^ of length n/12. As a special case, we have a(i,o) = 

and a(i_„/i2) = (aO"/i^ For convenience, we denote by W(ej e2,....e6) the string made up by the following 
6 blocks: a(i^ei)a(2.e2) ' ' ■'^(e.ee)- Lst 5* be the set consisting of all strings [ ], where w is of the form 
W(ei,...,ee)'^in/i2-ei,...,n/i2-ee) fo^' any six indiccs 61,62, . . . , 65 G [0,?i/12]z. An example of such w is shown in 
Figure [21 Notice that, for any symbol a G E — {#}, if w G S* then #a(w) — n/12. Moreover, ^^(w) — n/2. 
Now, we choose jo ~ n/4 and k = n/2. A simple observation gives \S\ — {n/12 + 1)^. Let u be an arbitrary 
string in E-'". To estimate |S'i,„|, we note that \Si^u\ < |'5'i^#n/4| for any index i G [l,n — jo]z- This gives a 
simple upper bound: < (n/12 + 1)^. Obviously, since n — 864rn, we have 

in I , ^ / '^^^ m(n + 12)^ /n \6 

5i„-fcmn< hi = ^ ^< hi ^\S\. 

' ' ' - V12 / 2 3456 V12 / ' ' 

The swapping lemma provides an index pair i,j with n/4 < j < n/2 and i + j < n and a string 
pair x,y G E" with middi^i+j{x) ^ middi^i+j{y) such that [ ^f^, ], [ ft(''„) ] G S and [ ^"1, ], [ ^l, ] G 
L , where x' and 2/' are two swapped strings defined as x' = prefi(x)middi^i+j{y)sufn-i-j(x) and 
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y' ~ prefi{y)middi^i^j{x)sufn-i-j{y)- Since the substrings middij^j{x) and middi^i+j(jj) have length 
j, middi^i+j{x) ^ middi^i+j[y) imphes that ^aimiddi^i^jlx)) / ^ai'm-iddi^i+j{y)) for a certain symbol 
a e E — {#}. Hence, we conclude that ^a{x') ^ n/\2. This is a contradiction against the statement 
that [ ^^^j ] e L. Therefore, Equals cannot be in CFL/n. □ 

Since the language Equals belongs to co-CFL, as shown in Example 14. 31 we can derive the following strong 
separation between CFL/n and co-CFL/n. 

Proposition 4-4 co-CFL ^ CFL/n. Or equivalently, co-CFL/n 7^ CFL/n. 

The second part of the above proposition follows from the fact that co-CFL C co-CFL/n. This proposition 
also indicates that CFL/n is not closed under complementation. 

5 Proof of Lemma 14.11 

As we have seen in Examples 14.21 and 14.31 the swapping lemma for context-free languages are a powerful tool 
in proving non-context-freeness with advice. This section will describe in detail the proof of Lemma l4.11 The 
proof requires an analysis of a stack's behavior of a nondeterministic pushdown automaton (or an npda) . 

5.1 Nondeterministic Pushdown Automata 

Although there are several models to describe context-free languages, here, we use the machine model of 
npda's. We first review certain facts regarding the npda's. Let L be any infinite context-free language over 
an alphabet S with |S| > 2. Since Lemma [4.11 targets only inputs of length at least 2, it is harmless for us 
to assume that L contains no empty string A. Now, consider a context-free grammar G = (V, T, 5*, P) that 
generates L with T = E, where y is a set of variables, T is a set of terminal symbols, S ^ is the start 
variable, and P is a set of productions. Without loss of generality, we can assume that G is in Greibach 
normal form; that is, P consists of the production rules of the form A — > au, where A £ a S S, and 
u G y*. A process of transforming a context-free grammar into Greibach normal form is described in many 
undergraduate textbooks {e.g., [4]). 

Closely associated with the grammar G, we want to build an npda M = (Q, S, F, 6, qo, z, F), where Q is 
a set of internal states, F is a stack alphabet, (5 is a transition function, qo ^ Q is the initial state, z G F is 
the stack start symbol, and F C Q is a set of final states. For our npda M, we define Q = {qo, qi, q/}, z ^ V, 
and T = V U {z}, F = {qf}. To make our later argument simpler, we include two special end-markers ^ and 
$, which mark the left end and the right end of an input, respectively. Hereafter, we consider only inputs 
of the form ^x$, where a; G E*, and we sometimes treat the endmarkers as an integrated part of the input. 
Notice that \^x$\ = \x\ + 2. For convenience, every tape cell is indexed with integers and the left endmarker ((; 
is always written in the 0th cell. The input string x of length n is written in the cells indexed between 1 and 
n and the right endmarker $ is written in the n -f 1st cell. 

When we express the content of the stack of M as a series s = S1S2S3 ■ ■ ■ Sm, we understand that the 
leftmost symbol si is located at the top of the stack and the Sm is at the bottom of the stack. At last, the 
transition function 5 is defined as follows: 

(1) S{qo,^,z) ^ {{qi,Sz)}; 

(2) 5{qi, a, A) ~ {{qi,u) \ u V* , P contains A au} for every a G S and A ^ V: and 

(3) = 

It is important to note that M is always in the internal state qi while the tape head scans any cell located 
between 1 and n. Note also that, during an accepting computation, the stack of the npda M never becomes 
empty because of the form of production rules in P. Therefore, we can demand that S should satisfy the 
following requirement. 

(4) For any symbol a G S, 6{qi,a, z) — 0. 
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Additionally, we modify the npda M and force its stack to increase in size by at most two by encoding 
several consecutive stack symbols (except for z) into one new stack symbol. For instance, provided that 
the original npda M increases its stack size by at most 3, we introduce a new stack alphabet F' consisting 
of (vi), {V1V2), and {V1V2V3), where vi,V2,V3 e F. A new transition 6' is defined as follows. Initially, we 
define S'{qo,(!^,z') = {{qi,S'z')}, where S' = {S) and z' — (z). Consider the case where the top of a new 
stack contains a new stack symbol {V1V2V3), which indicates that the top three stack symbols of the original 
computation are V1V2V3. If M applies a transition of the form ((71,^1 W2W3) S S{qi,a,vi), then we instead 
apply {qi, {wiW2){w3V2V3)) e S'{qi,a,{viV2V3)). In case of (gi,A) £ d{qi,a,vi), we now apply (gi,(f2f3)) G 
S'{qi,a, {V1V2V3)). The other cases of S' are similarly defined. See, e.g., |4j for details. 

Overall, we can assume the following extra condition. 

(5) for any a G E, any w G F, and any w £ F*, if {qi,w) e (5(gi, a, w), then \w\ < 2. 

The aforementioned five conditions significantly simplify the proof of Lemma l4.1l In the rest of this paper, 
we assume that our ndpa M satisfies these conditions. For each string x ^ S, we write ACC{x) for the set of 
all accepting computation paths of M on the input x. Moreover, let ACCn = [J^^g ACC{x). 



5.2 Stack Transitions, Intervals, and Heights 

For the proof of Lemma |4. 11 we wish to present our key lemma. Lemma |5. II To describe our lemma, we need 
to introduce several necessary notions and notations. An intercell boundary i refers to a boundary or a border 
between two adjacent cells — the ith cell and the i + 1st cell — in our npda's input tape. We sometimes call 
the intercell boundary —1 the initial intercell boundary and the intercell boundary n + 1 the final intercell 
boundary. Meanwhile, we fix a subset S C LnE", a string x in 5*, and a computation path p of M in ACC{x). 
Along this path p, we assign to intercell boundary i a stack content produced after scanning the ith cell and 
before scanning the i + 1st cell. For convenience, such a stack content is referred to as the "stack content at 
intercell boundary i." For instance, the stack contents at the initial and final intercell boundaries are both 
z, independent of the choice of accepting paths. Figure [3] illustrates intercell boundaries and a transition of 
stack contents. 
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Figure 3: An example of intercell boundaries and stack contents 



An accepting computation path of the npda generates a length-(ri + 2) series (s_i, sq,si, . . . , s„, Sn+i) of 
stack contents with s_i = s„ = Sn+i = z and sq = Sz. We refer to this series as a stack transition associated 
with the interval /q = [^1,?^ + More generally, when I = [io,ii]z is a subinterval of Jq, we call an 
associated subsequence 7 = (si^, Si^+i, . . . , Si^) a stack transition with the interval /. We define the height at 
intercell boundary 5 of 7 to be the length \sb\ of the stack content st at b. By our choice of the ndpa Af given 
in Section [5.11 the minimal height is 1. 

For our purpose, we hereafter focus our attention only on stack transitions 7 with intervals [ioiiilz in 
which (i) we have the same height £ at both of the intercell boundaries io and ii and (ii) all heights within 
this interval are more than or equal to £. We briefiy call such 7 ideal. 

Let / = [«o;*i]z be any subinterval of Jq and let 7 = (si„, Si^+i, . . . , s^j) be any ideal stack transition 
with this interval /. For each possible height £, we define the minimal width, denoted minwidj {£) (resp., the 
maximal width, denoted maxwidj {£)) , to be the minimal value (resp., maximal value) |/'| = i[ — Iq for which 
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(i) /' — [zq, C /, (ii) 7 has height I at both intercell boundaries Iq and and (iii) at no intercell boundary 
i ^ I' , ^ has height less than ^. Such a pair (ig, i'^) produces a subsequence 7' = (s^^, Sij^+i, . . . , s^'^) of 7. In 
such a case, we say that 7' realizes the minimal width 'minwidi{£) (resp., maximal width maxwidi{£)). 

We say that a stack transition 7 has a peaA; ai i if |si_i| < \si\ and |si_|_i| < \si\. Moreover, 7 has a /Zai 
peofc m if < |sj;,| = = • ■ • = |si'J and \si'^+i\ < js^'J. On the contrary, we say that 7 has 

a 6ase at i if |si_i| > \s,\ and |sj+i| > |si|; 7 has a flat base in {i'o,i[) if |sj^-i| > = = • • ■ = |si'J 

and |si'^+i| > |si'J. See Figure S] for an example of (flat) peaks and (flat) bases. 




Figure 4: An example of track transition with interval / = [iq, ii]z and height £ 



At last, we state our key lemma, which holds for any accepting computation path p without any assumption 
other than jo > 2 and 2jo < k < n. 

Lemma 5. 1 [key lemma] Let 5* C L n S", let x be any string in S , and let p be any computation path of M 
in ACC{x). Assume that jo > 2 and 2jo < k < n. For any interval I = [io,ii]z Q [^Ij?^ + l]z with \I\ > k 
and for any ideal stack transition 7 with the interval I having height £0 at the two intercell boundaries iq and 
ii, there are a subinterval I' = [i'o,i'i\z of I and a height £ e such that 7 has height £ at both intercell 

boundaries Iq and i'-y, jq < \I'\ < k, and minwidi{£) < \I'\ < maxwidi{£). 

Proof. Fix six parameters {x,p,"/,£o,I) given in the premise of the lemma. We prove the lemma by 
induction on the number of peaks or flat peaks along the computation path p of M in ACC{x). 

(Basis Case) In this particular case, there is either one peak or one flat peak in 7 in the interval / — [io, ii\z- 
First, we consider the case where there is a peak. Let £1 be the height of such a peak. Note that minwidj{£) = 
maxwidi {£ + 1) + 2 for any height £ with £0 < £ < £1, because of the condition 5 provided for the npda's 
transition function S. Now, let £' be the maximal height satisfying that minwidi{£' + 1) < jo < minwidi{£'). 
Such £' exists because |/| > fc > jo- Let Imin = [*oi*i]z be the minimal interval such that 7 has height £' + 1 
at the two intercell boundaries i'o and i'l. Similarly, let Imax = [*0' *i]z be the maximal interval that satisfies a 
similar condition with £' + 1, i'o, and i". If jo = minwidi{£' + 1), then we choose the desired interval /' = Imin 
and the height £ — £' + 1 ior the lemma. If jo < maxwidi {£' + 1), then we pick an interval /' satisfying that 
Imin C /' C Imax and |/'| ~ jo- Wc also define £ = £' + 1 for the lemma. The remaining case to consider is 
that maxwidi{£' + 1) < jo < rninwidi{£'). In this case, it follows that 

jo < minwidi{£') — maxwidi{£' + 1) + 2 < jo + 2 < 2jo < k 

since jo > 2. Let I'min = [*Oj*i]z be the minimal interval such that 7 has height £' at the two intercell 
boundaries io and ii. It is thus enough to define /' = and £ = £' for the lemma. 

Next, we consider the case where there is a fiat peak in (12,^3) with height £1. If 13 — 12 > jo, then we 
choose /' = [12,^2 + jo]z and £ = £1 for the lemma. The other case where is — i2 < jo is similar to the "peak" 
case discussed above. 

(Induction Step) First, let c > 1 and consider the case where there are c peaks and/or flat peaks in the 
given interval / = [io:*i]z with |/| > fc. Choose the lowest base or flat base within this interval. If we have 
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more than one such base and/or flat base, then we always choose the leftmost one. Now, consider the case 
where there is the lowest base at Z2 and let £2 be the height at 12. Since 7 is an ideal stack transition, we have 
£2 > £o- Let I* = [i'Q,i[]z be the largest interval for which the height at both and i[ equals £2- The choice 
of /* implies that |/*| = maxwidi{£2)- We split /* into two subintervals Ii = [iQ,i2]z and I2 = [i2,i'i\z- If 
jo < 1-^*1 < k, then we set I' = /* and £ = £2 for the lemma. If |/*| < jo, then a similar argument used for the 
basis case proves the lemma. Now, assume that |/*| > k. Since k > 2jo, either one of of Ii and I2 has size 
more than jo- We pick such an interval, say I3. Let 7' be a unique subsequence of 7 defined in the interval I3. 
If I/3I < k, then we choose /' = I3 and £ = £2 for the lemma. Let us assume that I/3I > k. By the choice of I3, 
7' is an ideal stack transition. Since 7' has fewer than c peaks and/or flat peaks, we can apply the induction 
hypothesis to obtain the lemma. 

Next, we consider the other case where there is the lowest flat base in (12, ^s). We define /* = [io,*i]z 
as before. Unlike the previous "lowest base" case, we need to split /* into three intervals Ii — [jo,i2]z, 
I2 — [*2, «3]z, and I3 — [is, i'-^i- If either |/*| < jo or jo <\I*\ < k, then it suffices to apply a similar argument 
used for the "lowest base" case. Now, assume that |/*| > k. Since k > 2jo, either one of the two intervals 
Ii U I2 and I3 has size more than jo. We pick such an interval. The rest of our argument is similar to the one 
for the "lowest base" case. □ 



5.3 Technical Tools 

Let M — {Q,I],r, d,qo, z, F) be our npda for L defined in Section [5TT1 We have already seen fundamental 
properties of our npda in Section 15.21 Now, let us begin proving Lemma 14.11 by contradiction. First, we set 
our swapping-lemma constant m to be jFp and assume that the conclusion of Lemma |4. II is false for this to; 
that is, the following assumption (a) holds for four fixed parameters (n,jo,fc, S') given in the premise of the 
lemma. We fix these parameters throughout this subsection and its subsequent subsection. 

(a) There are no indices i G [1, n]z and j € [jo, k]z with i + j < n and no strings x — X1X2X3 and y = 2/1^22/3 
in S wfth \xi\ = \yi\= i, |a;2| = \y2\ = j, and {xsl = lys] such that (i) X2 ^ 2/2, (u) xiy2X3 S L, and (iu) 

yiX2y3 e L. 

Recall that S" is a fixed subset of L n S". Meanwhile, we fix additional five parameters x G S, j G [jo, fc]z, 
is [l,n — j]z, w g r, and p € ACC'{x). As a technical tool, we introduce the notation Gij^p(x : v). Roughly 
speaking, Gij^p(x : v) expresses a part of stack content that is newly produced from its original content vs 
during the head's scanning the cells indexed between i + 1 and i+j, provided that the npda scans no symbol in 
s. Note that, when the npda is deterministic, the information on p can be discarded, because p is completely 
determined from x. More precisely, Gij^p{x : v) denotes a unique string t G T* (if any) that satisfies the 
following three conditions, along the computation path p with the input x. 

1. The stack consists of vs at the intercell boundary i, where s £ F*. 

2. At the intercell boundary i+j, the stack consists of ts. 

3. While the head scans any cell indexed between i + 1 and i+j, the npda never accesses any symbol in 
s; that is, no transition of the form {qi, w) G S{qi, a, r), where r is a symbol in s, is applied. 

With the fixed parameters {i,j,v,t,p) described above, we use the notation T-^ltp denote the set 
{x G 5 I Gij,p{x : v) — t}. A crucial property of Tj^l ^ ^ is stated in the following lemma. 

Lemma 5.2 We fix j G [jo,fc]z, * G [l,n — j]z, p,p' G ACCn, w G F, and t G F*. For any two strings 
x,y in S, if x & T^^ltp '^'^'^ V ^ "^fltp'' then two swapped strings prefi{x)middi^i+j{y)sufn-i-j{x) and 
prefi{y)middi ijf.j{x)sufn^i-j{y) are both in L. 

Proof. Assume that the npda's stack consists of vs (resp., vs') at an intercell boundary i along an accepting 
path p (resp., p') on an input x (resp., y). Since x G Tj^l^^ ^ (resp., y G T^j^ltp')^ npda generates a stack 
content ts (resp., ts') at the intercell boundary i + j. Note that, during the head's scanning the cells indexed 
between i + 1 and i + j, the npda never accesses s (resp., s') along the path p (resp., p'). Therefore, we can 
swap two substrings middi^i+j{x) and middi^i+j{y) written in the cells indexed between i + 1 and i+j in x and 
y, respectively, without changing the acceptance condition of the npda. Therefore, the npda accepts the two 
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stimgs pre fi{x)'middi^i+j {y)sufn-i-j (x) and pre fi{y)middi^i+j {x)sufn-i-j (y) ■ This impHes the conclusion of 
the lemma. □ 



Recall from Section 2] the notation Si^^j- Now, we consider the following statement, 
(b) There are no j G [jo,k]z, i € [17"- ~ j]z, P,p' G ACCn, t e F*, or u,w e S-' such that u ^ w, 

T-!lt,p n ^^,n ^ 0, and r« n ^ 0. 

Lemma 5.3 The statement (a) implies the statement (h). 

Proof. Assume the statement (a). To show the statement (b), let us assume on the contrary that the 
statement (b) does not hold. This means that certain parameters (i,j,p,p',t,u,'w) satisfy the following 
conditions: u ^ w, T^^l^t,p ^ 0; and T^^^ ty n Si^^ 7^ 0- Now, we take two strings x € T^%w,p ^ ^i,^ ^'^'^ 

y G T^''l ^ p,C\Si^w Notice that |m| = |w| = j. Since x G Si^u and y G 5;.^,, it follows that u = middi^i+j{x) and 
w — middi_i+j{y). Lemma [5.21 then implies that the swapped strings x' — prefi{x)middi^i+j{y)sufn-i-j{x) 
and y' — prefi{'y)middi^i+j{x)sufn-i^j{y) are both in L. This contradicts the statement (a). Therefore, the 
statement (b) holds. □ 

From Lemmas 15.21 and 15.31 the choice of accepting paths for strings in S is of little importance. Hence, for 
later convenience, we write Tj^l ^ to denote the union UpG ACC„ '^j^v,t,p- 

5.4 Closing Argument 

Under our assumption that Lemma |4. II is false, we want to lead to a desired contradiction, which immediately 
proves the lemma. To achieve our goal, we utilize Lemma 15.11 given in Section 15.21 Notice that, by Lemma 
15.31 the statement (b) now holds. Recall also that three parameters {n,jo, fc, S) are fixed through our proof. 

For our convenience, we write A for the index set {(i, j, v, w) | j G [jo, k]z, i G [1, n — j]z, v,w £ F}. The 
cardinality of this set A is equal to 

I A| = {k- jo)(n - jo + l)|r|2 = m{k - 30 + jo + 1) 

since m = |Fp. We want to assign each string a; in 5 to a certain element (i, j, v, w) in A. For this purpose, 
we first show the following lemma, which can be obtained immediately from Lemma |5. 11 

Lemma 5.4 Assume that the statement (a) holds. Let jo and k satisfy that jo > 2 and 2jo < k < n. For 
any string x £ S, there is an element {i,j, w, G A for which x G T^*] ^■ 

Proof. Let x be any string in S and consider any computation path p of the npda M in ACC{x). We 
denote by 7 the stack transition of M with the interval /o = [— l,n+ l]z. Lemma 15.11 guarantees, in the 
stack transition 7, the existence of a subinterval /' = [jqi^'iIz and a height i satisfying that jo < |/'| < fc, 
minwidi{£) < \I'\ < maxwidi{£), and 7 has height £ at both of the intercell boundaries and i'^. For our 
desired index values i and j, we define them as i — iQ and j = |/'| (= i'l — io)- Now, let us consider the changes 
of stack contents while Af 's head scans through the interval /'. Let vs be the stack content at the intercell 
boundary Iq and let ws' be the stack content at i[, where v,w €T and s, s' G F*. Note that £ = \vs\ = \ws'\. 
Since all heights in /' arc at least £, within this interval /', M does not access any symbol in s; hence, we 
conclude that s — s' . This implies that Gij^p{x : v) equals w. Therefore, x should be in Tj^^ ^. □ 

Lemma 15.41 establishes a key association of strings in S with elements in A. Using this association, we 
introduce a map e from 5 to A. For each string a; in S*, assuming an appropriate lexicographic order for 
A, we denote by e{x) the minimal element {i,j,v,w) G A satisfying that x G T^^l ^ Notice that this 
minimality requirement makes e{x) uniquely determined from x. With this map e, we define Aij^y^^ as the 

set {a; G 5" I e{x) = (i, j,v,w)}. Obviously, it follows that Ai^j,y_w Q T^^l w- 
Now, we claim the following property of the map e. 

Claim 1 There exist two strings x,y S and also two strings u, z G such that u ^ z, x E Si_u, y G Si^z, 
and e{x) = e{y) = {i,j,v,t). 
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If this claim is true, then we take four strings {x, y, u, z) given in the claim. Since e{x) = e{y) = {i, j, w, t), 

we obtain x,y ^ ^i.j,v,w Since ^ij>,tu C it follows that x G Tj'l n; n Si^u and y £ T^j'l w n Si^^- This 

is obviously a contradiction against the assumption (b) and hence the assumption (a). Therefore, Lemma |4. II 
should hold. 

What remains undone is the verification of Claim [1] Let us prove the claim. Since e(-) is a map from S to 
A, there is a certain element (i,j,v,w) G A satisfying that > |S'|/|A|. Fix such an element (i,j,v,w). 

For any string u G E-', since |A| = m{k — jo + l)(n — jo + 1), we obtain 

w(/c -Jo + !)("-- Jo + 1) |A| 

where the first inequality is one of the premises of Lemma 14.11 Since 5* = Umsej ^i,u^ the above inequality 
concludes that there are at least two distinct strings u, z G S-' for which certain strings x S Si,u and y E St^z 
map to the same element {i,j,v,t). This completes the proof of the claim and therefore completes the proof 
of Lemma 14.11 



Acknowledgments. The author thanks Satoshi Okawa and Francois Le Gall for a helpful discussion on a 
fundamental structure of context-free languages. 
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